rank | frequency | n-gram |
---|---|---|
1 | 6701 | -a |
2 | 5649 | -e |
3 | 4474 | -i |
4 | 4242 | -y |
5 | 2820 | -m |
rank | frequency | n-gram |
---|---|---|
1 | 2570 | -ie |
2 | 2123 | -ch |
3 | 1373 | -ej |
4 | 1286 | -em |
5 | 1283 | -go |
rank | frequency | n-gram |
---|---|---|
1 | 1319 | -nie |
2 | 1257 | -ego |
3 | 1242 | -ych |
4 | 899 | -nia |
5 | 616 | -ach |
rank | frequency | n-gram |
---|---|---|
1 | 616 | -nych |
2 | 419 | -nego |
3 | 405 | -anie |
4 | 384 | -ania |
5 | 365 | -iego |
rank | frequency | n-gram |
---|---|---|
1 | 314 | -kiego |
2 | 224 | -owych |
3 | 200 | -skiej |
4 | 178 | -owego |
5 | 171 | -wania |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings